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Abstract — We define capacity for group testing probiems and 
deduce bounds for the capacity of a variety of noisy models, based 
on the capacity of equivalent noisy communication channels. 
For noiseless adaptive group testing we prove an information- 
theoretic lower bound which tightens a bound of Chan et al. 
This can be combined with a performance analysis of a version 
of Hwang's adaptive group testing algorithm, in order to deduce 
the capacity of noiseless and erasure group testing models. 

I. Introduction and notation 

We consider the problem of noiseless group testing, as in- 
troduced by Dorfman JT1 in the context of testing populations 
of soldiers for syphilis. Group testing has recently been used 
to screen DNA samples for rare alleles by a pooling strategy, 
and relates to compressive sensing - see |2|-||4|. It also offers 
advantages for spectrum sharing in cognitive radio models (5). 

Suppose we have a population of N items, K of which 
are defective. At the ith stage, we pick a subset Si of the 
population, and test all the items in Si together. If Si contains 
at least one defective item, the test is said to be positive. If 5, 
contains no defective item, the test is said to be negative. 

The group testing problem requires us to infer which items 
are defective, using the smallest number of tests. We have 
the freedom to design the tests in a way that minimises the 
(expected) number of tests required. For simplicity, this paper 
will assume that the number of defectives K is known. 

The problem as described above is referred to as noiseless, 
since the test output is a known deterministic function of the 
items tested. In this case, positive and negative tests allow us 
to make complementary inferences. If the ith test is negative, 
we know that all the items in Si are not defective, so we 
need not test them again. If the ith test is positive, at least 
one item in Si is defective, but it will usually require further 
testing to discover which. We refer to an item which has not 
yet appeared in a negative test as a Possible Defective. 

It is natural to consider noisy models of group testing. 
For example, |6| considers an "additive model" (only false 
positives occur) where a negative test may erroneously be 
reported positive with probability p. The paper [7| considers 
a "symmetric error model" where the output of the tests are 
transmitted through memoryless binary symmetric channels 
with error probability p. Finally, we consider an "erasure 
model", where with probability p, the test simply fails, and 
returns an erasure symbol. 



In this paper we consider adaptive testing, where the test 
design depends on the outcome of previous tests, in contrast 
to the non-adaptive case, where the entire sequence of tests 
is specified in advance. In other contexts (see for example 
1 8 1), the ability to perform adaptive measurements can pro- 
vide significant improvements (for example, moving problems 
from polynomial accuracy to exponential accuracy). Perhaps 
surprisingly (see (51), in the context of group testing, the 
improvements are smaller, in that allowing adaptive algorithms 
only provides an improvement by a constant factor in the 



bounds (see for example Theorems 2.2 and 2.4 1. 

We follow the notation of Atia and Saligrama [6] who 
formulated group testing as a channel coding problem. We 
make analogies in this sense between the number of tests T 
required and the code block length, and between the number 
of possible defective sets (^) and the number of possible 
messages. This suggests that in the spirit of Shannon JTOI we 
should consider the rate as log 2 (^) /T. This prompts us to 
introduce the capacity of a group testing problem: 

Definition 1.1: Consider a sequence of group testing prob- 
lems, indexed by the number of items N = 1, 2, The Ath 

problem has K = K(N) defective items, where 1 < K < N. 
and T = T(N) tests are available. We refer to a constant C 
as the capacity if for any e > 0: 

1) any sequence of algorithms with 



lim inf ■ 

AT-s-oo 



log 2 



{k(n)) 



T(N) 



>C + e, 



(1) 



has success probability tending to 0, 
2) and there exists a sequence of algorithms with 



lim inf 

N— >oo 



lOg; 



N \ 
K(N)I 

T(N) 



>C 



(2) 



with success probability tending to 1. 

Malyutov [4, Section 2] gives a different definition of 
capacity, based on the defintion of the rate (in our notation) 
as (log N)/T for given K. Malyutov's survey paper [4] gives 
a historical perspective on problems of this kind. 

Group testing makes particular gains when the set of de- 
fectives is sparse, in that K = o(N). Otherwise, we could be 
accurate to within a multiple of the optimal answer simply by 
testing all items individually. For benchmarking purposes, it 



is plausible to use the parameterisation used by (8), (TT| and 
others, that is, to take K = N 1 ^ 13 for some < /3 < 1. The 
main result of the paper is the following: 

Theorem 1.2: The capacity of the adaptive noiseless group 
testing problem is C = 1, in any regime such that K = o(N). 

Theorem 1.2 is proved in Section [III] by combining existing 
performance guarantees (summarised in Section [nil with a new 
information theoretic lower bound, Theorem |3.1| 

Remark 1.3: As discussed below, our Theorem |3.1| im- 
proves on a result of Chan et al (7) - see ([6]) below - which 
does not give the capacity in the sense of Definition |1.1| [7] 
shows that if log 2 (^) /T > 1 + e, then the success probability 
is bounded above by 1/(1 + e), rather than tending to zero. 
To be precise, Definition 1.1 requires a strong converse (in 



Definition 
the sense discussed in 
converse. 



12 



|), whereas [7] only proves a weak 



It is natural to ask what is the capacity of noisy group 
testing problems under different noise models. The sequence 
of outcomes of tests can be considered as a binary codeword 
encoding the defective set. However, tests can only verify 
whether a subset intersects the defective set, whereas in a 
general communication channel the encoding being agreed by 
the receiver and the sender may incorporate more information. 
Hence we can state a new key principle: 

if it exists, the capacity of a noisy group testing prob- 
lem can never exceed the capacity of the equivalent 
communication channel. 

To derive these capacity upper bounds we must use the strong 



converse to Shannon's channel coding theorem [12]. We then 



2) 



3) 



state the following bounds on group testing capacities: 

Theorem 1.4: 1) In any regime with K = o(N), for the 
"erasure model" with probability p, C = 1 — p. 
For the "symmetric error model" with probability p of 
Q, if it exists C < 1 — h(p), with h the binary entropy. 
It is natural to conjecture C = 1 — h(p). 
For the "additive model" with probability p of (6J, if it 
exists C < log(l + (1 — p)p p ^ 1_p -*), the capacity of a 
Z-channel. Again we conjecture equality holds. 

Proof: In each case, the lower bound follows from the key 
principle above, using standard values of channel capacities. 
The exact value for[TJ is derived in Section [Hi] below. ■ 

The structure of the remainder of the paper is as follows. 
In Section [U] we review existing approaches to the group 
testing problem, describing upper bounds on the number of the 
tests required, including Hwang's Generalized Binary Splitting 
Algorithm. In Section III we prove lower bounds on the 



number of tests required, giving an upper bound on the success 



II. Existing performance guarantees 

We first describe some simple bounds on the performance of 
group testing algorithms. We will describe these as bounds on 
the number of tests required, though in Figure [T] such bounds 
are plotted in terms of success probabilities. Clearly, an upper 
bound on the number of tests required will relate to a lower 
bound on the success probability, and vice versa. 

Early approaches to the group testing problem involved 
deterministic (combinatorial) test designs, as reviewed in fl3) . 
In particular, at least il(K 2 log N / log K) tests are required for 
such designs in the non-adaptive case to guarantee success. 

More recent work has focussed on the use of randomised 
test designs. As a result, Atia and Saligrama |6) were able to 
show that 0(K\ogN) tests would suffice with high proba- 
bility, for adaptive group testing, even in the noisy case. This 
work was developed by Chan et al j7j, who provided explicit 
algorithms, and corresponding bounds on their performance, 
in the non-adaptive case. In a paper |14| being prepared in 
parallel to this one we strengthen some of the results of J7j. 

Performance of many adaptive algorithms is guaranteed by 
the following simple lemma: 

Lemma 2.1: Given a set of b items that is known to contain 
at least one defective item, label its elements 1, . . . , b and let 
L be the smallest label of a defective item in the set. Then 
in [log 2 b~\ tests we can discover with certainty that item L is 
defective and that items 1, . . . , L — 1 are all non-defective. 

Proof: Use the following recursive procedure, which we 
refer to as 'binary search'. If necessary, we add 'dummy items' 
to create a set of size 2 r iog2 b ^ . At each stage, given a set of 
size 5 which is guaranteed to contain a defective, we label 
its items with integers {1, 2, . . . , 5}. We test the items with 
labels {1,2,..., 5/2}. 

1) If the test is positive, we have a set of half the previous 
size, guaranteed to contain a defective. 

2) If the test is negative, we know that items {5/2 + 
1, . . . , 5} must contain a defective item. 

Each test therefore halves the size of the set, with it remain- 
ing guaranteed to contain a defective item. The property of 
finding the defective with the smallest label follows easily by 
induction and by the fact that at each step we always test items 
{1,2,..., 5/2} before {5/2 + 1, . . . , 5}, and discard the latter 
if the former is found to be positive. ■ 
A simple adaptive algorithm with guaranteed performance 
bounds is given by Repeated Binary Testing - see for example 
[13, p24-5]. The algorithm simply performs binary search on 
the set of size N, in order to find a defective item. This item is 
then removed from consideration, and the next round of testing 
carries out binary testing on a set of size N — 1. Repeatedly 



probability, and proving the capacity theorem, Theorem [O using Lemma [2T] it is clear that this algorithm provides a 



In Section IV we give simulation evidence which shows that 
these bounds are tight for a range of success probabilities, and 



prove Theorem 4.3 which shows we can improve on Hwang's 
algorithm, and can be adapted to suggest bounds on success 
probabilities in general. 



performance guarantee for adaptive testing. 

Theorem 2.2 ( 113]): Repeated Binary Testing is guaran- 
teed to succeed in K |~log 2 N~\ < K log 2 N + K tests. 

The Repeated Binary Testing algorithm is inefficient, in that 
each round of tests starts by testing large sets, which are very 



likely to contain at least one defective. In that sense, the early 
tests in each round are very uninformative. 

Hwang's Generalized Binary Splitting Algorithm fTB") is 
designed to overcome this problem. Hwang suggests testing 
groups of size 2 Q , where a is an integer chosen to ensure that 
the probability of the test being positive is close to 1/2. If the 
test is negative, all the items in it can be immediately classified 
as non-defective. If the test is positive, it must contain a 
defective, which can be found in a tests using the binary 



search procedure of Lemma 2.1 above. 

Using this procedure, Hwang [15, Theorem 1] deduces an 
upper bound on the number of tests required, which further 



analysis (see 1 1 3 Corollary 2.2.2]) shows is close to optimal 



in the sense discussed in the proof of Theorem 1.2 



Theorem 2.3 ( [13]): Given a problem with A" defectives in 
a population of size N, Hwang's adaptive Generalized Binary 
Splitting Algorithm is guaranteed to succeed using 



T = log 2 



K tests. 



(3) 



In order to see the gain due to adaptivity, this bound can be 
compared with the following result, Theorem 4 of J7J. 

Theorem 2.4 ( [7]): The Combinatorial Orthogonal Match- 
ing Pursuit (COMP) algorithm recovers the defective set with 
error probability < N~ s , using T = (2(1 + 8)e)K\nN tests. 



Theorems 2.3 and 2.4 can be compared using well-known 
bounds on the binomial coefficients, that is for all K and N: 



K\og 2 (N/K)<\og 2 



<Klog 2 (Ne/K), (4) 



In the regime K(N) = N 1 -? considered by |[sj , (Tl], we can 
see that Q and Theorem |2.3| can be combined to show that 
Hwang's adaptive algorithm is guaranteed to succeed in 

(3K \og 2 N + K(\og 2 e + 1) 

tests. In contrast, Theorem |2.4| shows that the (non-adaptive) 
COMP algorithm [7] requires 

'2(l + (5)e s 
log 2 e 

showing that adaptivity offers asymptotic gains over the 



!Hog 2 N w 3.76(1 + 5)K\og 2 N, 



bounds of Theorem 2.4 which are greatest when /3 is small - 
which is when there are relatively many defectives. 

III. New information-theoretic lower bound 

We can contrast the performance guarantees of Section III] 
with a new information-theoretic lower bound, Theorem |3.1| 
Note that Theorem |3.1| applies to both adaptive and non- 
adaptive group testing. It is possible that a tighter lower bound 
could be found if we restrict ourselves to non-adaptive tests, 
but we are not aware of such a result. 

Theorem 3.1: Consider testing a set of N items with K 
defectives. Any algorithm to recover the defective set with T 
tests has success probability P(suc) satisfying 

P(suc) < (5) 



Proof: Given a population of N objects, we write Sjv.a' 
for the collection of subsets of size K from the population. 
Further, we write T> for the true defective set. 

The testing procedure naturally defines a mapping 8 : 
^n,k — > {0, 1} T . That is, given a putative defective set 
S € ^n.k, write 8(S) to be the vector of test outcomes, with 
positive tests represented as Is and negative ones represented 
as 0s. For each vector y 6 {0, 1} T , write A y C Y,n,k for the 
inverse image of y under 8, 

A = _1 (y) = {S e ^n,k : 9(S) = y} , 

and write A y — \A y \ for the size of A y . 

The role of an algorithm which decodes the outcome of the 
tests is to mimic the effect of the inverse image map d^ 1 . 
Given a test output y, the optimal decoding algorithm would 
use a lookup table to find the inverse image A y . If this inverse 
image A y = {S} has size A y = 1, we can be certain that the 
defective set was S. In general, if size A y > 1, we cannot 
do better than to pick uniformly among A y , with success 
probability 1/A y . (We can ignore empty A y , since we are 
only concerned with vectors y which occur as a test output). 

Hence overall, the probability of recovering a defective set 
S is 1/Aqis), depending only on 8{S). We can write the 
following expression for the success probability, conditioning 
over all the equiprobable values of the defective set: 

(T) 



(sue) = 

1 



(sue i v = sy 



E W)=y)F(suc|P = 5) 

\Kl 



SGE N ,jf yG{0,l} T 

\k) se^N.K ye{o,i} T :^ y >i 



l(8(S) 



1 

] A V 



— y - 

(N\ a 
^ K > ye{0,l} T :.4 y >l y 

^ K > ye{0,l} T :A y >l y 

|ye{0,i} T :^ y >i| < 2 T 

(N\ — /N\ 



E ms) = y) 



since {0, 1} T , a set of size 2 T . ■ 
The fact that log 2 (^) is the 'magic number' of tests 
providing a lower bound on the number of tests required 
for recovery with success probability 1 is folklore - see for 
example |15|. However, the exponential scaling of success 
probability for lower numbers of tests which we provide here 
is new. Theorem |3.1| is a strengthening of Theorem 1 of (7), 
which implies that 

P(suc) < T (6) 

l0 §2 U) 

In fact, Theorem 1 of |7) is stated with K\og 2 {N/K) in the 
denominator - the stronger form given by |6]l is given within 
their proof. The differing form of <|5j and |6]l is plotted in 



Success rate in <=T tests 



K=10, N=500 



K=30, N=9699 




Fig. 1, Plot of success probability of Hwang's group testing algorithm for 
(K,N) = (10,500) and (30,9699). The simulated success probability of 
Hwang's algorithm is plotted as a bright green line, and the related algorithm 
analysed in Section [TV] is plotted in dark green. The upper bound on success 
probabilityof Theorem |3.1| is plotted in red, and the upper bound of [7] 
Equation |6l in blue. The dotted vertical line is at log 2 ( K )- 



IV. A TIGHTER UPPER BOUND 

We briefly illustrate the performance of Hwang's algorithm 
by simulation. For example, in the two cases of Figure [T] 
we keep the parameter /3 = 0.63 fixed and plot the success 
probability of the algorithm compared with the lower bound 
of Theorem |3.1| for different problem sizes. Notice that the 
(bright green) simulated success curve of Hwang's algorithm 
is very close to the (red) theoretical upper bound on success 
probability (lower bound on tests required). The dark green 
line corresponds to the performance of the same algorithm 
using the sample size discussed in Section |IV] The dotted 
vertical line is at log 2 (^), close to the rapid transition of 
success probability. 

Figure [T] clearly shows that the bound of Theorem 3.1 



Figure [T] emphasising the point made in Remark 1.3 that ([5]) 
is significantly stronger. 

Observe that (using the fact that for any random variable 
ET = J2Z (1~P(T < *)), (§) implies that for any algorithm allows ufJ t0 k ^ti- ack of the size of each subproblem (round). 



is close to the true performance of the algorithm in these 
cases. We therefore prove Theorem |4.3| an upper bound on 
the performance of a version of Hwang's algorithm, which 
reduces the gap between the upper bound of Theorem |2.3| and 
the information-theoretic lower bound, from K to under K/2. 
We group the tests in K rounds, each of which comprises 
a sequence of negative tests before a positive test. Once a 
positive test occurs, we can find a defective using binary search 



as in Lemma 2.1 We introduce the following notation, which 



that uses a random number of tests T to detect the defective 
set with certainty, the expected success time 



ET > log 2 



- 2. 



(i) 

Definition 4.1: We write ivj the number of Possible De- 
fectives left after the j-th consecutive negative test of the z'-th 
round and for the number of Possible Defectives at the 
(7) start of the z-th round, and likewise we write — K — i + J 



We now prove the main result of the paper, Theorem 1.2 

Proof of Theorem \1.2\ The result is obtained using the 
binomial coefficient bounds Q, with the lower bound meaning 
that in the regime K — o(N), then 



lim 

N-HX 



!og 2 ( 



N \ 
K(N)> 



for the number of defectives in the z'-th round. We write Sj 
for the indicator that the j-th test of the z'-th round is positive. 
Finally define Tj as the random time of the last negative test 
of the round. 

After every negative test the set of Possible Defectives will 
be reduced. In particular, if we denote by of' the size of the 



K{N) 



oo, 



(8) 



j-th test in the z'-th round, then 



since we also have K > 1. Now fix e > 0. First if, as assumed 
in |TJ for N sufficiently large, T(N) < log 2 ( K f N) ) then 



1 < i < K, 0<j<Ti 



(9) 



Theorem J^Tj shows that 
the strong converse, i.e. that 



sue) < ( K ^v)) € ^ 1+€ \ We deduce 
sue) tends to zero by (|8j. 
Theorem 2.3 shows that Hwang's Generalized Binary Split- 
ting Algorithm is guaranteed to succeed using T(N) = 
log 2 ( K f N s) + K(N) tests. We can deduce by mi that 
log 2 ( K ( N) ) /T(N) is greater than 1 - e, for all N sufficiently 



As described previously, we use maximally informative tests 
by making the probability of observing a negative test (just) 
less than 1/2. By truncating binomial coefficients, we prove: 
Lemma 4.2: Conditional on having observed j — 1 consec- 
utive negative tests in the z-th round of Hwang's algorithm, 
any random sample of size 



large, so d2j) follows with C = 1. 



bf > Nf\ 



1 - T 1 '^ 



(10) 



Proof of Theorem 1.4 



Extending this, we deduce the 
capacity of the erasure model for adaptive group testing in 
the K = o(N) regime is exactly 1 — p. We simply repeat 
any erased test until erasure fails to happen and then use 
Hwang's algorithm. We need to have a number of non-erased 
tests greater than the bound of Theorem |2.3| The probability 
that this doesn't happen goes to exponentially, hence with 
log (^) /(1—p—s) tests the probability of success approaches 
1 exponentially fast. ■ 



has probability less than 1/2 of being negative, that is 
P{sf = | S$ = 0, . . . , S^2i = 0) < 1/2. 



r(0 



If the test is 



If the test is negative, we update Nj using 
positive, the only upper bound we can derive for sure for the 



new size of the set of possible defectives is < iV-jlj — 1, 
as we're only sure that one (defective) item will be removed 
from the set. By induction, within each round the following 



formulae for b[ 1 ^ and hold: 



3 

j-i r 



2 jcW 



iv(i-2- 1 / K<I) )-( J ft:«-i) 



A/ 



2 k(0 A# } + 



(If* 4 



3-1 
h=0 



(11) 

(12) 



Here TVq ; clearly depends on the previous round; it equals N 
for i = 1 and it can be suitably upper bounded for i > 2. With 
these results we can produce an upper bound on the average 
number of tests for this version of Hwang's algorithm. 

Theorem 4.3: Our version of Hwang's algorithm is guaran- 
teed to succeed with number of tests satisfying: 

T tot < K\ogN+ (1 + log In 2) AT - log if! + R . (13) 



where R is a negative random term. 
Proof: We observe that 



N^, ( L 



-1/K& 



< 2N, 



(0 



3-1 



-1/ifW 



(14) 



Similarly, repeatedly substituting ( 10 1 in ([9]) we obtain 

N (i) < N «) 2 -i/K W . (15) 

This can be plugged back into l ] 14} , giving 



1.(0 



. 2iV (i) 2-( J '- 1 ^' pfCi) (l - 2- i/ftV ") 



(16) 



We can then compute the upper bound on the total number of 
tests using Lemma |2.1| The number of tests in the ith round 
satisfies: 

Ttot =Ti + log <T % + 1 + log 



Tj 



+ log(l-2- 1 /^ ) ) . (17) 



In order to obtain a bound on the total number of tests we 

(i) (i— 1) 

have to relate TVg ' with Nj, ' . Recalling Lemma 



2.1 



lettmost 



define Li by making 1 + Lj be the position of the 
defective in the (Tj + l)-th sample, thus obtaining the equality 



N, 



(i+l) 



N, 



(i) 



Ti 



(18) 



Using iteratively the update formulae < Nq2 H k ' 



and ( pLSj t, we can deduce that log A^ 



(i+l) 



logiV 



E 

3=1 



X(3) 



log 



equals 



N ^ W h 2 



-Tj/KW 



Summing together the bounds for the number of tests in each 
round and calling R = J2i=i @i> we obtain: 



K 



Ttot < K log N + K + lo g(l - 2" 



i?. (19) 



To gain a more manageable expression, we bound the penul- 
timate term of (19i. Calling f(x) := 2 _2: — 1 + xln2, notice 
that f'(x) > for x > and /(0) = 0; in particular, 

1 - 2- 1 /" 1 < 1 In 2. Substituting = K-i+1, reversing 

i 

the order of the sum and taking logs, we deduce the result. ■ 



It is hard to say more about the random terms Cj than 
that they are negative, though we hope that future simulation 
and probabilistic bounds will give us further insights into the 
resulting number of tests required. Notice the slightly suprising 
feature that the number of negative tests T, has no effect on 



the final bound (19 1. This can be explained by the fact that 
summing ( 17 1 over i creates a double sum over i and j, and 



that the coefficients of Tj exactly cancel. 

V. Conclusion 

Using a sharper information-theoretic lower bound, we have 
shown that in the noiseless adaptive case, the capacity of group 
testing is 1, and that for an erasure channel, the capacity is 
1 — p. For other noise models, we have found lower bounds 
on the capacity. It remains of interest to find exact values of 
the capacity in other cases, including non-adaptive problems. 
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